STOCHASTIC MULTIPLE CHOICE KNAPSACK ASSORTMENT OPTIMIZER 



CROSS-REFERENCE TO RELATED APPLICATIONS 
The present application claims priority from U.S. Provisional Application No. 
60/255,624, titled "Stochastic multiple choice knapsack assortment optimizer," filed 
December 13,2000. 

TECHNICAL FIELD 

The present invention relates to data processing generally, and more particularly, 
to data processing for resource optimization. 

BACKGROUND ART 
Combinatorial problems and various discrete solution systems are well known in 
the art of operations research and the art of optimization generally. (T. H. Cormen, C. E. 
Leiserson and R. L. Rivest, Introduction to Algorithms, MIT Press, Massachusetts, 1990; 
R. Fletcher, Practical Methods of Optimization, 2nd edition, John Wiley and Sons, 
Chichester, 1987; Hillier, F. S., and Lieberman, G. J., Introduction to Operations 
Research, McGraw-Hill, New York, 1995; R. G. Parker and R. L Rardin, Discrete 
Optimization, Academic Press, Boston, 1988; Handbook of Combinatorial Optimization, 
vol. 1, D.-Z. Du and P. M. Pardalos, eds., Kluwer Academic Publishers, Drodrecht, 
1998.) 

In many business applications, however, combinatorial optimization presents 
substantial challenges in that the computational resources and time required to develop an 
acceptable solution are typically unacceptable. Several approaches to providing a rapid 
solution to a combinatorial problem include the use of parallel digital processors to solve 
such problems as articulated in US patent 5,537,593 issued to FMC Corporation™ on 
July 1 6, 1 996. IBM™ Corporation has received numerous patents on various methods 
of optimizing inventory and materials requirements planning (MRP), which are basically 
resource allocation problems, such as US patent 5,216,593 issued June 1, 1993. Others 
have received patents on neurocomputers suitable for solving an optimum combination 
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problem. Nevertheless, the technical problem of developing a highly efficient and cost 
effective system and method for solving hard combinatorial problems remains. 

In many business applications, traditional attempted solutions to technical 
combinatorial problems involving resource allocation rely on periodic static data, 
acquired after-the-fact, are based on cumbersome, dueling spreadsheets, and require 
extensive time to run scenarios of multiple resource allocation options. What is desired is 
an improved system and method to solve these technical combinatorial problems to 
achieve optimal resource allocation. 

SUMMARY OF THE INVENTION 

In one embodiment of the present invention, a method of determining allocations 
in a business operation to maximize profit includes: collecting profit data for a plurality 
of classes in the business operation, where each class includes an allocation having a cost 
function and each allocation belongs to the group consisting of physical allocations and 
economic allocations; determining profit functions for the allocations from the profit 
data; formulating a Multiple Choice Knapsack Problem to maximize profit from the 
profit functions, the cost functions, and a cost constraint; and solving the Multiple Choice 
Knapsack Problem to determine values for the allocations. 

According to one aspect, determining the profit functions includes: determining 
demand distributions for the allocations from the profit data; and determining each profit 
function from a corresponding demand distribution. The demand distributions may 
depend on probability distributions such as Poisson models, normal distribution models, 
and Markov models according to the requirements of the operational setting (e.g., 
Poisson models and normal distribution models for non-replacement or fashion retail and 
Markov models for replacement or commodity retail). 

According to another aspect the spatial allocations may include spatial allotments 
(e.g., widths of shelves). The method may be further specialized so that all allocations 
are physical allocations. 

According to another aspect, the economic allotments may include monetary 
allotments (e.g., dollar amounts). The method may be further specialized so that all 
allocations are economic allocations. 



According to another aspect, the cost constraint may be further specified as a 
greater-than-or-equal-to inequality, an equality, or a less-than-or-equal-to inequality. 

In another embodiment of the present invention, a system for determining 
allocations in a business operation to maximize profit, includes: a data unit, a profit- 

5 model unit, and an optimization-engine unit. The data unit has a memory that includes 
profit data for multiple classes in the business operation, where each class includes an 
allocation having a cost function that is stored in the memory. The memory also 
including a cost constraint. The profit-model unit, which is connected to the data unit, 
includes executable instructions for determining profit functions for the allocations from 

10 the profit data. The optimization-engine unit, which is connected to the data unit and the 
profit-model unit, includes executable instructions for formulating a Multiple Choice 
Knapsack Problem to maximize profit from the profit functions, the cost functions, and 
the cost constraint, and for solving the Multiple Choice Knapsack Problem to determine 
values for the allocations. This embodiment of the present invention may include aspects 

1 5 described above. For example, the allocations may be exclusively physical allocations or 
exclusively economic allocations or some combination thereof. 

In another embodiment of the present invention, computer-readable media 
tangibly embodies a program for determining allocations in a business operation to 
maximize profit. The program including executable instructions for: collecting profit 

20 data for a plurality of classes in the business operation, where each class includes an 

allocation having a cost function; determining profit functions for the allocations from the 
profit data; formulating a Multiple Choice Knapsack Problem to maximize profit from 
the profit functions, the cost functions, and a cost constraint; and solving the Multiple 
Choice Knapsack Problem to determine values for the allocations. This embodiment of 

25 the present invention may include aspects described above. For example, the allocations 
may be exclusively physical allocations or exclusively economic allocations or some 
combination thereof. 

The present invention enables optimal resource allocation in a variety of business 
operations to maximize profit while constraining cost, where resources may include 

30 economic and physical allocations, and where profit and cost may be defined in a number 
of ways. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows a view of different combinations of unit quantities, cost of units, 

and profit from units associated with assortment optimization performed by an 

embodiment of the invention. 

Figure 2 shows an exemplary process overview of an embodiment of the 

invention. 

Figure 3 shows an exemplary view of multiple profit curves generated for a set of 

items. 

Figure 4 shows an exemplary solution to a generalized problem associated with an 
embodiment of the invention. 

Figure 5 shows other exemplary business problems that may be solved by an 
embodiment of the invention. 

Figure 6 shows an exemplary flowchart of an assortment optimization 
embodiment of the invention. 

Figure 7 shows an exemplary item attribute table associated with an embodiment 
of the invention. 

Figure 8 shows an exemplary item demand and price table associated with an 
embodiment of the invention. 

Figure 9 shows an exemplary system attribute table associated with an 
embodiment of the invention. 

Figure 10 shows an exemplary season attributes table associated with an 
embodiment of the invention. 

Figure 1 1 shows an exemplary item choice table associated with an embodiment 
of the invention. 

Figure 12 shows an exemplary constraint table associated with an embodiment of 
the invention. 

Figure 13 shows an exemplary optimal solution table associated with an 
embodiment of the invention. 



DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS . 

1 . Invention Overview 

The present invention provides a method and system for performing stochastic 
optimization, and in particular, assortment optimization including the determination of 
physical and economic allocations to maximize profit in a business operation. In the 
following description, numerous details are set forth in order to enable a thorough 
understanding of the present invention. However, it will be understood by those of 
ordinary skill in the art that these specific details are not required in order to practice the 
invention. Further, well-known elements, devices, process steps and the like are not set 
forth in detail in order to avoid obscuring the present invention. 

Figure 1 shows an optimization problem in spreadsheet form according to an 
embodiment of the present invention. For example, a retail buyer in a business operation 
may have an assortment choice for stocking three products: item A 102, item B 104, and 
item C 106. For each of these items, the buyer may choose how many units to buy 108 
for which there is a corresponding cost 1 1 0 and a corresponding profit 1 12. Typically 
the cost data is imposed on the buyer, for example, by a distributor. The data for the cost 
for each item may be a simple linear function of the number of units as shown for items 
A, B, and C, or the cost function may reflect other factors such as a discount for volume 
purchases. 

Analogously, the data for the profit for each item may be a simple linear function 
of the number of items as illustrated by the profit column for Item C 106. More 
realistically, however, the profit data is a complex result of multiple factors including 
demand, competition and overhead costs, whether static or seasonal. Thus, for Item B 
104 the profit levels at a saturation value of three units, and for Item A 102 the profit 
actually decreases at a breakpoint value of four units. Realistic estimates for profit values 
may require substantial empirical analysis of market trends or at least a sensitivity to 
seasonal fluctuations. 

A cost constraint 1 14 is imposed as a budget limit on the total cost of all the items 
bought. Then Figure 1 illustrates the problem of choosing how many units to buy for 
each of three items 102, 104, 106, to maximize the total profit where the total cost is 



constrained. That is, for each item the buyer chooses the number of units to buy (i.e., 
from each item's units-to-buy column) to maximize the total profit (i.e., a sum of values 
from each item's profit column) subject to the constraint 1 14 on the total cost (i.e., a sum 
of values from each item's cost column) . For the example shown is Figure 1, there are 
6 =216 possible choices (i.e., from 0 to 5 for each item) and the optimal result 1 16 is 
highlighted. 

The optimization problem illustrated in Figure 1 has the structure of a Multiple 
Choice Knapsack Problem (MCKP). Typically the MCKP is formulated as: 

maximize £5>,*, 

k 

subject to £2>^<c (1-1) 

2X=i i=i,...,£, 

jeN, 

x 9 e{0,l}, i=l,...,k,jeNi 

In Eq. (1-1) there are k classes Nj (i=l k) of items corresponding to the three 

items 104, 106, 108 in Figure 1. Each j e N t corresponds to a row index for an item in 
Figure 1 . The allocation variable x y has the value 1 if and only if the allocation from row 
j is chosen for the class analogously to the highlighted solution 1 16 in Figure 1. The 
parameters p ijt w 9 , and c in (PI) can likewise be identified with the entries in Figure 1. 
The profit py cm be identified with the profit columns in Figure 1, the weight w v can be 
identified with the cost columns in Figure 1, and the capacity (or cost) c can be identified 
with the cost constraint 1 14 in Figure 1 . Thus, the example shown in Figure 1 can be fit 
generally into the class of Multiple Choice Knapsack Problems. In general, such 
problems are computationally difficult to solve as characterized by the complexity level 
NP-hard. However, as discussed in greater detail below, practically useful methods can 
be implemented to solve realistic problems in many operational settings. 

Figure 2 shows an embodiment according to the present invention for solving the 
optimization problem shown in Figure 1. Profit data 1 18 is received by a Profit Model 



120 and used to determine profit functions 122, which may be identified with the-profit 
columns 1 12 in Figure 1, and cost functions, which may be identified with the cost 
columns 1 10 of Figure 1. An optimization engine 128 receives the profit functions 122, 
the cost functions 124, and additionally a cost constraint 124, which may be identified 
with the cost constraint 1 14 of Figure L The optimization engine 128 determines 
allocations to maximize profit 128 (e.g., by solving Eq. (1-1)), which maybe identified 
with the optimal solution 1 16 shown in Figure 1 (i.e., the allocations of units to buy). 

Further details associated with the components of the embodiment shown in 
Figure 2 are presented below. 

2. Profit Models 

Profit models 120 of varying complexity may be used to determine profit 
functions 122 according to the present invention. Additionally cost functions 124 are 
also obtained, often as an intermediate step in the determination of the profit functions 
122 or as a direct (and simpler) calculation from the profit data 1 18. These models may 
rely on simple formulas that are easily implemented by spreadsheet as well as more 
complex stochastic modeling principles. For instance, an exemplary simple profit model 
A may use an invariant demand of D units for an item. Model A may use the assumption 
that a seller will sell any quantity of items that is purchased up to and including D units. 
Any units purchased over D would remain unsold. Profit for model A may be defined as: 

Profit = Sales Income - Cost Of Goods. (2- 1 ) 

Here the elements of Eq. (2-1) may be further defined by simple formulas that are easily 
implemented in a spreadsheet: 



Sales Income 
Cost Of Goods 



(2-2) 



Increasingly complex profit models may also be used according the requirements 
of an operational setting. Probability distributions may be tailored to specific 
applications. For example, profit model B may use a fluctuating demand (e.g., 
units/week ) that may be characterized by a probability distribution function having a 
mean equal to M and a variance equal to V. profit model B may calculate unit sales as an 
expected value over many time periods and define profit to include other factors as: 

Profit = Sales Income - Cost Of Goods - Freight - Carrying Cost - Other Costs. (2-3) 

As another example, an exemplary profit model C according to an embodiment of 
the invention describes the total demand (in number of units) of an item over the full life 
cycle of the item as a Poisson distribution with mean = Then, the probability that the 
demand will exactly equal N units is given by the probability density function of the 
Poisson distribution evaluated at \i and N. (i.e, probability = PDF[|a,N]). Then if a buyer 
purchases a quantity of items equal to "Supply", then the expected number of units sold 
is: 



From the value of UnitSales, other values including the Profit can be determined. 
For example, if the selling price of a single unit of the item is equal to UnitPrice, the sales 
income from the unit sales is given as: 



Likewise, if the purchase cost of a single unit of the item is equal to UnitCost, the cost of 
purchasing M units (e.g., M=Supply) of the item is given as: 



Supply~\ co 



UnitSales= Y j N*PDF\ju,N]+ ^Supply •PDF\}i,N] 



(2-4) 




Saleslncome = UnitPrice * UnitSales 



(2-5) 



InventoryCost = UnitPrice * M 



(2-6) 
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Then profit can be defined in terms of these quantities as: , 

Profit = Saleslncome - InventoryCost (2-7) 

5 Eqs. (2-4) - (2-7) illustrate an embodiment of a profit model 1 20 that uses a 

probability distribution (e.g., a Poissson distribution) for the demand to create profit 
functions 122. Preferably the profit model 120 generates an independent demand 
distribution function for each class (or item), possibly by stochastic analysis using 
representative probability distributions as discussed above. Then from each demand 
10 distribution an independent profit function can be determined as discussed above. Figure 
3 shows an exemplary view of multiple profit curves generated for three distinct items 
900, 902, 904. Then in the optimization engine 128 combinations of items 900, 902, 
904 and quantities 906, 908, 910 are evaluated against a constraint entered by the user, 
and the optimizer engine returns the solution generating the optimal gross profit. (Note 
that Figure 3 represents the profit functions as continuous curves although the 
optimization engine 128 takes discrete quantities as inputs as in the columns of Figures 1 
and 3.) 

The use of probability functions to model demand distributions in the profit model 
120 may be tailored to specific operational settings and the corresponding availability of 

20 data. For example, the Poisson and normal distributions can be especially advantageous 
in the context a non-replenishment retail (or fashion retail) where the demand 
characteristics do not reflect continual replenishment. In many cases inventory is 
replenished as needed over a long period of time. Examples of this type of item include 
commodities such as aspirin and toothpaste. In these cases, as items are sold, an 

25 automated replenishment system typically places an order that initiates shipment of 

additional items to the store. Once the items arrive at the store, a stock clerk places some 
or all of them on a shelf in the sales area of the store. Customers arriving at random times 
deplete the on-shelf supply causing the replenishment system to begin the cycle again by 
ordering additional items. Since items of this type are frequently displayed in designated 

30 fixed-width areas on a shelf, the retailer must decide how much width to allocate to each 
item that goes on the shelf. In some cases the best decision is to allocate shelf width equal 
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to the width of a single unit of the item. In other cases the retailer may wish to dedicate 
enough width so that two or more units may be placed beside each other on the shelf. In 
retail terminology this is referred to as deciding how many "facings" of an item will be 
placed on the shelf. In this scenario, the profit model calculates the profitability of each 

5 item as a function of the number of facings that are included on the shelf. More 

specifically the profit model calculates the profitability as a function of the shelf width 
occupied by the facings. 

In the literature there are examples of modeling the inventory replenishment 
process by using Markov chains. (Hillier, F. S., and Lieberman, G. J., Introduction to 

10 Operations Research, McGraw-Hill, New York, 1995.) For example, if the state of the 
system is defined as the number of units of an item that are on the shelf immediately 
following a restock cycle, a Markov transition matrix for a Markov chain can be used to 
model the demand and on-shelf supply between restocking the item. By calculating the 
stationary probabilities of this matrix, one can determine the probability of finding any 

1 5 given number of units of the item on the shelf. This piece of information, combined with 
the probability density function of the demand for the time interval between restocking 
cycles, allows one to calculate the expected sales of the item, and then its profitability. 

An advantage of using of Markov chains is that one can calculate the profitability 
very quickly and accurately on a computer. In some situations, however, the behavior of 

20 the shelf re-supply process may be sufficiently complex so that it is easier to create a 
profit model from a simulation (e.g., a Monte Carlo simulation). In this type of 
simulation, one can use an appropriate random number generator (e.g. Poisson) to create 
a series of random numbers that represent the demand for an item in the interval between 
restocking cycles. The simulation then steps through this series of numbers and, at each 

25 step, takes the action that an actual replenishment system and stock clerk would have 
taken in the situation. Through this process one can simulate how many units of the item 
are on the shelf between restocking cycles. By comparing this to the random numbers 
that represent the demand, one can calculate the expected sales of the item, and then its 
profitability. To get more accurate results, one can run this simulation repeatedly and 

30 then average the results. 
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Consistently with the above discussion, the spreadsheet characterization of profit 
functions and cost functions shown in Figure 1 can be generalized as illustrated by 
Figures 4 and 5. Analogously to the example shown in Figure 1, Figure 4 shows an 
exemplary solution to a generalized problem associated with an embodiment of the 
invention. For example, one generalized problem presents two attributes 508, 510 
(Attribute 1 and Attribute 2) and a given a number of classes 500, 502, 504, 506 such that 
an embodiment of the invention selects exactly one choice per class such that: 

1 . The sum of Attribute 2 5 1 0 for all classes 500, 502, 504, 506 is a 
maximum or a minimum to meet an objective 512, for example, maximization of 
Attribute 2; and 

2. The sum of Attribute 1 508 for all classes 500, 502, 504, 506 satisfies 
some constraint 514 such as <, >, =, >, or < a limit. 

For example, using Figure 4, and the constraint being Attribute 1 < 10,500, the 
circled choices show the sum of Attribute 2 being maximized, wherein Attribute 2 is 
profit. 

An embodiment of the optimization engine 128 then provides a solution to 
scenario-based retail problems, such as assortment planning and floor space allocation. 
The embodiment determines the most profitable answer for a given situation (e.g., 
potential items for the assortment) and its constraint (e.g., OTB ("Open To Buy") dollars 
or floor space). The present invention enables a simultaneous solution to the 
optimization problem and uses a comprehensive profit definition which may be based on 
a probability-based profit model. The invention also enables modeling the fluctuating 
nature of demand and selecting the assortment with optimal profit, for example, using 
OTB dollars as the constraint. 

Figure 5 shows other exemplary business problems that may be solved by an 
embodiment of the invention. For example, an embodiment solves a business problem 
that is associated with a main constraint 608, a class 602, and two attributes, Attribute 1 
604 and Attribute 2 606. Attribute 2 606 may be, for example, item profit. An 
embodiment of the invention then creates a customized profit model 120 associated with 
the specific details of the business problem, 
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Figure 5 illustrates a number of possible embodiments tailored to specific 
business problems that may arise in a variety of business operations. The business 
problem may be associated with a dimensionally constrained assortment optimization 
610, the main constraint may be total width, and the first attribute may be an item width. 
5 The business problem may be associated with a budget constrained assortment 

optimization 612, the main constraint may be associated with a total budget, and the first 
attribute may be an item cost. The business problem may be associated with advertising 
space allocation 618, the constraint may be associated with a total area of advertisement, 
and the first attribute may be an item advertisement area. The business problem may be 

1 0 associated with catalog space allocation 620, the constraint may be associated with a total 
area of catalog pages, and the first attribute may be an item advertisement area. The 
business problem may be associated with a start of season budget allocation 614, the 
constraint may be associated with a total budget available for merchandise, and the first 
attribute may be an item cost. The business problem may be associated with retail floor 

1 5 space optimization 6 1 6, the constraint may be associated with total floor space, and the 
first attribute may be item width. The business problem may be associated with loan 
portfolios 622, the constraint may be associated with a total loan budget, and the first 
attribute may be an item loan amount. The business problem may be associated with 
merchandise allocation to stores from a warehouse 624, the constraint may be associated 

20 with total units in the warehouse, and the first attribute may be units shipped to the stores. 

3. Optimization Engines 
3.1 MCKP Solvers 

As discussed above with reference to Figure 1, the optimization engine 128 
25 determines the allocations by solving an MCKP, where the classical formulation is given 
as: 

maximize 

k 

subject to ZX>^.<c (3-1) 

1=1 jsN, 
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JX =1 i-=l,...,k, 

x g e{0 9 l} 9 i=l,...,kj eNi 

In Eq. (3-1), k classes #/,... iV* of items are given to pack into a knapsack of 
capacity (or cost) c. Each item j e N t may have a corresponding profit p s and a we^/i/ 
w ij9 and the problem is to choose one item from each class such that the profit sum is 
maximized subject to some constraint on the weight sum. Typically these parameters are 
assumed to be integers, but these assumptions can be relaxed for many optimization 
contexts (e.g., to include real-valued parameters) depending on the application setting as 
well as the solution method. Here it is generally assumed that the weight values Wy are 
non-negative and the capacity c value is also non-negative. The profit values p i} may be 
positive, negative or zero. These assumptions may also be relaxed under some 
conditions. (D. Pisinger and P. Toth, "Knapsack Problems", in Handbook of 
Combinatorial Optimization, vol. 1, D.-Z. Du and P. M. Pardalos, eds., Kluwer Academic 
Publishers, Drodrecht, 299-428, 1998.) 

In (3-1) the constraint appears as a less-than-or-equal-to inequality so this variant 
may also be denoted as MCKPL. An alternative version of the problem denoted as 
MCKPE imposes the constraint as an equality constraint: 

k 

maximize 

k 

subject to £ E w 9 x y = c (3-2) 

1=1 jsN, 

E x y = 1 i = \, . . . ,k, 
x v e{0,l}, i=l,...,k,j sNi 

Additionally, a third alternative version of the problem denoted as MCKPG 
imposes the constraint as a greater-than-or-equal-to inequality: 
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maximize £2>„x, 

subject to XI>^> C (3-3) 

IX =1 i=l,...,A, 
yew, 

^€{0,1}, i=l i.y'eJVi 

In most operational settings the preferred formulation of MCKP is MCKPL, 
where the cost constraint appears as an upper bound (e.g., a hard budgetary or spatial 
limit). As an alternative formulation, MCKPE is sometimes preferred where all 
resources must be utilized or exhausted. And finally MCKPG may arise for example in 
cases where the cost appears as a lower threshold of utilization. 

In an exemplary embodiment, the optimization engine 128 includes an MCKP 
solver to determine the solution 130 to the allocation problem defined by the profit 
functions 122, the cost functions 124 and the cost constraint 126. The MCKP 
formulations presented above fall into a larger class of problems known as integer 
programming problems, and a variety of solution techniques are possible including both 
exact and non-exact methods. (R. Fletcher, "Practical Methods of Optimization", 2nd 
edition, John Wiley and Sons, Chichester, 1987; R. G. Parker and R. L Rardin, Discrete 
Optimization, Academic Press, Boston, 1988.) In the context of discrete optimization, 
MCKP is NP-Hard as it contains KP (i.e., the Knapsack Problem) as a special case. 
Some methods exploit the particular structure of the MCKP formulation, particularly in 
the standard form given by Eq. (3-1 ). (D. Pisinger, "A Minimal Algorithm for the 
Mulitple-Choice Knapsack Problem", European Journal of Operational Research (83) 2, 
pp 394-410, 1995; D. Pisinger and P. Toth , "Knapsack Problems", in Handbook of 
Combinatorial Optimization, vol. 1, D.-Z. Du and P. M. Pardalos, eds., Kluwer Academic 
Publishers, Drodrecht, 299-428, 1998) 

The remaining subsections of this section present an embodiment of a method for 
solving MCKP based on dynamic programming to provide an exact solution in pseudo- 
polynomial time. 
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3.2 Exact solution of MCKPE 

This subsection presents a dynamic programming method for providing the 
optimal solution to MCKPE. Also techniques for finding the optimal solution vector are 
discussed. The next subsection (3.3) presents improvements to the dynamic 
programming algorithm; for example, space consumption may be reduced from 0(nc) to 
0(c) at almost no cost in the solution time. Also several techniques are presented for 
decreasing the constants in the running time 0(nc). The next subsection (3.4) shows how 
other variants of the problem may be addressed, including solving MCKPL and MCKPG 
by means of the method for solving MCKPE. The next subsection (3.5) shows 
computation results that illustrate the practical performance of the method and 
summarizes some of its advantages. 

Since MCKPE is A/P-hard, exact solution methods generally must rely on some 
enumeration techniques, where dynamic programming is generally preferred since it 
gives a tight bound on running time. In the method shown here, dynamic programming is 
used to solve MCKPE directly. 

First it is shown that MCKPE has an optimal substructure property, in which an 
optimal solution consists of optimal subsolutions. That is, if x m = } is an 

optimal solution to MCKPE, then the subsolution x {h) = {*.. } w h for any h = 0, ... , 
k is an optimal solution to the restricted MCKPE problem defined on the classes 1, . . . , 
h and with capacity d = c - V w-ffl 

The proof is by contradiction. Assume that x ik) ={fy} H k jsN is an optimal 
solution to MCKPE, but that the subsolution Jc (A) = {*.. } w h for any h = ft . . . , k is 
not an optimal solution to the problem 
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maximize 

k 

subject to ]T X w ij x u = c 



(3-4) 



Then, there must exist a solution 3c (A) = \x y } j e AT. , to (3-4) that yields a 

better solution value. By setting 5t~ = jc iy for i = h + 1, . . . , kj e one has constructed 

a complete solution jc ( * } to MCKPE. This solution 3c (/,) is feasible, since in comparison 
to x {k) that was a feasible solution, different items in classes 1, . . . , h may be chosen that 
had an identical weight sum. Moreover, this solution x {k) will have abetter objective 
value than x (k) since a subset of the solution x {k) is replaced with items that have a larger 
profit sum. This contradicts the original assumption that x {k) is an optimal solution to 
MCKPE and proves the result. 

Knowing the property of optimal substructure, one can derive a dynamic 
programming algorithm. L&flh, d) be an optimal solution to the MCKPE problem 
defined on classes 1, . . . , h with capacity d, i.e. 

h 

maximize 



/(M)= 



«=1 yeATs 
h 

subject to YsY, w v x ij = d 

i=i j*N, 

5X =1 i = l,...,h 

JeN, 



(3-5) 



h e {0,1}, 



i=l,...,k,j eN r 



With the assumption that/(%,<^=-oo if no solution exists to (3-5), one obtains the 
following recursion: 

f(h,d) = msx(f(h-l,d-w v )+p hi ) for/i = 1, . . .,k,d = 0, ...,c (3-6) 
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where it is assumed that the expression inside the parenthesis returns - oo if d - wy < 0. 
For h = 0 there is only one possible solution to (3-5), namely, f{0, 0) = 0. For all other 
capacities d = 0, . . . , c, one sets/0, d) = - oo so as to indicate that no solution exists. 

Thus to solve MCKPE, the recursion (3-6) may be used to fill a table J(h, d} of 
dimensions (* + 1) x (c + 1). Since each evaluation of (3-6) takes OfrifJ time, one obtains 
the time complexity o(c£* =o n,.)= 0{cn). The space complexity is 0{kc) due to the 
dimension of table/ The optimal solution value may be found as z =fjc, c), where a 
value of - oo indicates that the weight sum c cannot be obtained by any combination of 
items from the classes. Not only is the optimal solution for capacity c obtained, but in 
addition, the tabled, •) contains all optimal solutions for capacities 0, . . . , c. 

The recursion (3-6) determines the optimal solution value for all capacities 0, . . . , 
c but it does not give any information about the corresponding solution vector. But 
having the table fih, d) permits one to use backtracking techniques to determine the latter. 
Thus it is possible to determine the solution vector for capacity c, i.e. the vector 
corresponding toJ[k, c). To find out which item was chosen from the last class it, the 
following equation is tested: 

f(k -l, c -Wy)+ p v = f(k, c) (3-7) 

for all values of/' e N k . The equality must be satisfied for some value of/' e N k smczftk, 
c) was obtained by performing the reverse operation in (3-6). Assume that / is the 
proper item. Then, = 1 , and that all other values of % should be set to 0. Moreover, 
let d = c- Wy. , be the new capacity. Then the same process may be repeated in a 
recursive way, now looking at class k - 1 with capacity d. 

The running time of the algorithm is 0(n) because for each class h = k, . . . , 1 
equation (3-7) is tested for ally e N t thus taking time o{^ h=x n)= 0(n). According to 

this approach, however, the entire tabled, d) is stored to derive the optimal solution 
vectors. 
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33 Improved dynamic programming algorithm 

Although recursion (3-6) may be used in its straightforward form, the complexity 
may become excessive for problems with many classes and/or large capacities. In 
particular the space complexity may quickly become a bottleneck when capacities are 
large (e.g. c = 1 000 000). Several techniques improve space and time consumption of 
the method, as follows: 

• All calculations can be made within a single array of length c + L 

• The solution vector may be derived in a very space-efficient way. 

• Only values of *f(h 9 d) for specific values of it need to be evaluated in recursion 
(3-6). 

• Sorting of classes according to their weight span may improve the practical 
performance. 

• Dominance in classes removes items that will not lead to an optimal solution. 

• Loop unrolling may be used to improve caching. 

(a) Space Reduction 

A variant of the method reduces the requirements for space. In terms of space 
reduction, because the recursion (3-6) derives entries//*, ) from//*-/, •) there is only a 
need to store the two most recently considered columns of the table, thus decreasing the 
space complexity to 2(c +1). 

To decrease the space consumption further, in order to evaluate//*, d) 9 an 
alternative method only addresses the table for capacities it d - w% < d. For example, in 
recursion (3-6) to determine/A, d) for decreasing values of the capacity d = c, c - 
1, . . . , 0, only a single table of length c + 1 is needed because the table/d) for a given 
value of it holds entries flh - 1 , d) for d = 0, . . . , d' and values//*, d) for 
*/ = </'+!,. ..,c. In other words, the recursion is evaluated in-place, which provides 
minimal space consumption. Although only a single table is necessary for storing/A, d) 9 
the table is still written with two arguments, where the first index refers to the current 
stage of the evaluation as defined by (3-5). 

(b) Finding the Solution Vector in a Space-efficient Way 
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A variant of the method finds the solution vector in. a space-efficient way. -After 
reducing the space consumption to a single table of length c + 1, finding the solution 
vector in a straight-forward manner is desirable. It is then possible to re-compute the 
solution vector by solving a number of MCKPE problems. (U. Pferschy, "Dynamic 
Programming Revisited: Improving Knapsack Algorithms", Computing, 63, pp 419-430, 
1999.) For example, assume that for a given capacity c one has found the optimal 
solution z to (3-2). If one knows the weight sum d of the optimal solution when class 
/ = Lk/2j was considered in recursion (3-6), then one is a step closer to finding the 
solution vector. Knowing the weight sum of the optimal solution at the "mid-point" of 
the algorithm, one can split the original problem into two problems, the first being 
defined on the first Lk/2j classes of the problem, the latter problem being defined on the 
remaining classes of the problem: 

[t/2] 

maximize j^^PgXg 

i=1 jeN, 
k 

subjectto Y,T, w iJ x v= d 

^jXy = 1 i = 1, . . . , k, 

jeN, 

x,e{<Ui i = l 9 ...,k 9 jeN i 



and 



i=[*/2j+l jeN, 



maximize > > p .x 



subject to J] J] WgX- =c-d 

E^ =1 1 = 1,..., A:, 

jeN, 

x v <={0,l}, i = l,...,tc,jeN i 



(3-9) 



If one repeats this process recursively, a number of problems are obtained 
considering only a single class, and it becomes easy to determine the optimal solution 
vector of a MCKPE defined on a single class. 
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This approach therefore solves 1 + 2 + 4 + • • • + 2 log2 *=l + 2 + 4 + -- - + 2fc<4fc 
problems. Because the capacity of the problems is decreasing, the time complexity 
becomes the following: The first problem takes O(nc) time to solve. The following two 
subproblems take 0{{nl2)d) and 0((n/2)(c-d)) time respectively, which in total takes 
0{ncl2) time. Using the same observation, the next four subproblems take 0(nc/4) time 
to solve, and in total the following time complexity is obtained: 

'ill ^ 



nc nc nc 
nc + — + — + — + ••• = nc 
2 4 8 



1 + - + -+- + — \<2nc (3-10) 
2 4 8 J ; 



Thus, to derive the optimal solution vector this approach uses twice the time needed to 
find only the optimal solution value. 

10 It is however not obvious to see how one keeps track on the weight sum d of the 

optimal solution when class i = Lk/2 J was considered in recursion (3-6). For this reason 
an auxiliary table g(h, d) is introduced, which for h > Lk/2j holds the weight sum of the 
solution^/*, d) when the class i = Lk/2j is considered. In other words one extends 
recursion (3-6) to the following. For i < Lk/2j one just sets g(h, d) to any value, e.g. zero. 

1 5 For h = Lk/2 J one has the recursion 

f(h 9 d) = f(h -l,d- w Af )+ p hf 

g{h,d) = d (3-11) 
f = arg max yeJV /(A -\d~ w hf )+ p hf 

For h > [k/2] one has 

g{h 9 d) = g(h-l 9 d-w hf ) (3-12) 
/ = arg max MN j(h -\d~ w v )+ p hf 

This recursion ensures that g(Ar, c) holds the weight sum of the optimal solution when 
class i - Lk/2j is considered. 

As noted in the previous section, it is not necessary to save the whole table g(h, it) 
but only to store a single table of length c + 1. 

(c) Evaluating the Recursion for a Limited Range of Capacities 
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The recursion (3-6) should be evaluated for all values of h = 1, . . . , k and d = 
0, . . . , c but actually it is not necessary to consider all capacities. Let for each class i = 
1, . . . , k the minimum and maximum weight in the class be given as 

w. = min w {i w. = max w.. 
Then at stage h of recursion (3-6) the weight sum d cannot be obtained if 



i=l A 

or 



*=1,.. h 



A: 



max 



(d) Preordering of classes 



(3-13) 



0<</< (3^4) 



2>,. <</<c (3-15) 



so that one sets f(h,d) = -co for these values of d. Thus the recursion only needs to be 
evaluated for the remaining capacity values d in the interval 

E ^ * J ^ min 1 Z^> c f (3-16) 

Although this does not change the worst-case time complexity of 0(nc) it can reduce the 
solution times for many practical instances. 

If one wishes to solve the MCKPE problem only for the capacity c (this occurs 
e.g. when we are looking for the solution vector corresponding to the optimal solution), 
even fewer capacities need to be considered in the recursion. In this case, at stage h, one 
does not need to consider capacities d which cannot reach the capacity c even if one 
chooses the heaviest items in all the remaining classes f =h + 1, . . . , k it Thus one need 
only consider capacities where 



d+Y^w^c (3-17) 
which reduces the considered capacities d to 



I X WiiC-JjWit^d <min\ £ w,,c[ (3-18) 
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As we saw in the previous section, it was only necessary to evaluate recursion (3- 
6) for specific values of the capacity d. In order to get the largest benefit of this 
optimization, one should order the classes according to increasing values of the weight 
span Si which for class i is defined as 
5 s^w^w. (3-19) 

A motivation for sorting the classes according to increasing s,- is to limit as long as 
possible the range of capacities that need to be considered in (3-6). A simple example 
may illustrate this approach 
10 Example 1 Assume that one has three classes. given as 

Nj ={(0,0), (9,9)}, N 2 = {(0,0), (3,3), (5,5)}, N 3 = {(0,0), (1,1)}, c= 15 (3-20) 

If one treats the classes as given, one will consider the capacities 0, . . . , 9 in the first 

¥~ iteration, capacities 0, . . . , 14 in the second, and capacities 0, . . . , 15 in the third 

Q 

p 15 iteration. This will in total demand 41 evaluations of (3-6). The weight spans are 9, 5 

] *f and 1 respectively, so if one treats the classes in reverse order, one will need to consider 

Cf capacities 0, . . . , 1 in the first iteration, capacities 0, . . . , 6 in the second, and capacities 

\j 0, . . • , 15 in the third iteration. This will in total demand 25 evaluations of (3-6). 
?_ (e) Dominance of items 

20 A variant of the method uses dominance within each class to remove some of the 

ijj items a-priori, thus improving the solution time of the algorithm. 
Jt Definition 1 Assume that one is solving the MCKPE problem. If two items r and s in the 
same class Nj satisfy that 

w ir = w is and p ir > p is (3-21) 
then one says that item r dominates item s. 
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Obviously an optimal solution exists where no dominated items occur: Assume 
that an optimal solution used item s from class N f which is dominated by item r. Then by 
choosing item r instead of item s one obtains a solution with the same weight sum, and a 
30 not smaller profit sum. Thus one may delete dominated items from each class before 
running recursion (3-6). 
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For the MCKPL variant of the problem, a stronger concept of dominance can be 

used: 

Definition 2 Assume that one is solving the MCKPL problem. If two items r and s in the 
same class Ni satisfy that 

w ir <Wi S andp ir >p is (3_22) 
then one says that item r dominates item s. 

Obviously an optimal solution to MCKPL exists where no dominated items occur, 
and thus one may a-priori delete these items. 

Finally for the MCKPG variant of the problem, one uses the following concept of 
dominance. 

Definition 3 Assume that one is solving the MCKPG problem. If two items r and s in the 
same class N t satisfy that 

w ir >w is andp ir >p is (3_23) 
then one says that item r dominates item s. 

Since an optimal solution to MCKPG exists where no dominated items occur, one 
may delete all dominated items a-priori. 
(f) Loop unrolling 

The recursion (3-6) is easy to implement but it may be quite time-consuming to 
evaluate on modern processors, since it does not utilize caching very well. The problem 
is, that in order to evaluate^ it may be necessary to make several look-ups in the table/ 
at positions^ - w hj ) fovj e N h . Since the values w hj may be scattered the caching 
strategy of the processor will not work very well. 

As a rule of thumb the cache ensures that whenever one makes a look-up in table f 
at position d then also the value of f at positions around d will be available. To use this 
fact, it may be appropriate to evaluate several values of f (d) at the same time. Thus in 
recursion (3-6) we determine 

fih, d),f[h, d+l), .. . ,J(h, d + M- 1) (3-24) 
at the same time for a reasonable value of M, (e.g. 16). Initially one sets all 
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M,d+ l) = -co for i = 0,...,M-l (3-25) 
Then one runs through the values j e N h one by one, setting 

fth, d+l) max {f[h, d + i),i{h-\,d+i- w kJ + p hj ) (3-26) 

As seen this means that the values w hj and p hJ can be stored locally, since they are 
used in a number of calculations, and that the table/is addressed in a linear way. The 
values//*, d + i) should be stored locally to improve performance (and to ensure 
correctness of the algorithm). 

10 

3.4 The MCKP with different capacity constraints 

Having solved MCKPE to optimality for all capacities d = 0, . . . , c it is not 
difficult to solve MCKPL. One simply sets 

g /(M) = jnax f(k,d') ford = 0,...,c (3-27) 

O 15 

j*f which can be done in time O(c) if one considers the capacities it in increasing order, 

iO since then one has 

H f(k,d) = max{/XM-l), f(k,d)} (3-28) 

1^20 forJ=l,...,c. 

I'll 

f! To find the solution vector corresponding to the optimal solution value of 

□ MCKPL, it is necessary to determine for which capacity d* the largest profit sum is 

obtamed. This can be done by keeping track on the most promising value of d* as part of 
(3-28). The optimal solution vector is then found as for MCKPE, where d* is the 
25 capacity. 

In a similar way one may also solve the MCKPG problem. It is necessary 
however to demand that the maximum capacity c used in recursion (3-6) is at least as 
large as the maximum obtainable weight sum. If this is the case then the tabled, d) 
holds the optimal solution value of MCKPE for all obtainable weight sums d. In this case 
30 one may set 

f(k,d)=raax c f(k,d') ford = 0,...,c (3-29) 
24 
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which can be evaluated efficiently by considering the capacities d in decreasing order 
setting 

f(k,d) = max{f(k,d), f(k,d + \)} (3 _30) 

for d = c- 1, . . . , d. 

In view of the requirement that c is larger than the total weight sum obtainable, 
the solution vector for MCKPG will correspond to the best solution value obtainable for 
this value of c. The arguments to the algorithm do not make it possible to find a solution 
vector for a smaller value of c. 



Computational experiments and summary 

The worst-case solution time of the method presented above is 0(nc) where n is 
the total number of items, and c the maximum capacity for which the solution should be 
g derived. To evaluate the practical performance of the algorithm and to uncover the 

constants involved in the term 0(nc), a number of test instances with varying problem 
size and data-range were executed. Three types of randomly generated data instances 
were considered, for a different number of classes k and size «,-. The data-range R is a 
constant stating the magnitude of the parameters generated. 

• Uncorrelated data instances (UC): In each class we generate m items by choosing w & 
20 and p i} randomly in [ 1 , R] . 

• Weakly correlated data instances (WC): In each class, Wy is randomly distributed in 
[1, R] andpij is randomly distributed in [wy - 10, wy + 10], such that p 8 > 1. 

• Subset-sum data instances (SS): wy randomly distributed in [1, R] and p i} = wy 

The currently biggest practical problems solved have 500 classes with 10 items 
25 per class, so a number of experiments with instances similar to this configuration were 
run testing varying data-range R and capacity c. The solution times are reported for tests 
run on a Intel Pentium 1 1 550 MHz computer with 64 Mb RAM. The internal space 
consumption of the algorithm is 12n+8c bytes, where it is assumed that a long integer 
takes 4 bytes (the return array for the histogram is not counted as part of the internal 
30 space consumption, but it takes Cc bytes). Thus for the largest instance considered in this 
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Section, having k = 500, w, = 20 and c = 1000000 the space consumption was 12 * 10000 
+ 8 * 1000000 = 8120000 bytes, a reasonable space consumption. 

Tables 1 to 4 report the solution times for capacities c = 10000, 100000 and 
1000000, class sizes n t = 5, 10 and 20, data-range R = 100, 1000 and 10000. 

Tables 1 and 2 report the running times for deriving the histogram of all optimal 
solution for capacities 0, . . . , c. The first table is for the MCKPE problem, the second 
table is for the MCKPL problem. It is seen that most of the solution times, as indicated in 
seconds, are very reasonable, making the algorithm applicable for interactive planning 
even for large instances. 
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Table 2: Running times for finding histogram to MCKPL, with an average of 10 

instances. 
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Table 3: Running times for finding optimal solution to MCKPE, with an average of 10 

instances. 
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Table 4: Running times for finding optimal solution to MCKPL, with an average of 10 

instances. 



Tables 3 and 4 report the running times for deriving the optimal solution vector 
corresponding to the capacity c. Table 3 considers the MCKPE variant of the problem, 
and Table 4 considers the MCKPL variant. In an embodiment, it takes about twice as 
long to find the optimal solution vector than to find the histogram. This corresponds well 
to the observation of equation (3-10). Any dynamic programming algorithm can have 
some overhead for finding the corresponding solution vector. 

An exemplary algorithm for the Multiple-choice knapsack problem has been 
presented above. The algorithm can vary by allowing the user to specify different 
relations in the capacity constraint, and by solving the problem for all capacities 0, . . . , c. 



The latter property makes the algorithm very applicable for interactive decision making, 
since it gives the decision maker a complete overview over the problem. 

Compared to simple dynamic programming recursion the improved algorithm 
decreases the practical solution times by a factor 5-20. Moreover, the space consumption 
5 has been decreased from 0(nc) to 0(c) at the cost of a small increase in the solution time. 
Without this latter improvement, it would not have been possible to solve certain large 
problems. A straightforward use of recursion equation (3-6) for the instance with n = 1 0 
000 and c = 1000 000 would have demanded approximately 40 Gb processing space. 
Instead, the algorithm can be run using, for example, only 8 Mb processing space. A 
1 0 final benefit of the algorithm is that it can be adapted for thorough error handling which 
should make it very robust against break-downs inside the corresponding code. 

M 4. Additional Embodiments 

□ Figure 6 shows an exemplary flowchart of an assortment optimization 

jj 15 embodiment of the invention, showing further detail than the embodiment shown in 
*f Figure 2. This embodiment is capable of solving either physically constrained problems, 
M such as determining the optimal number of facings of each item to include in a fixed 
M> amount of shelf width, or economically constrained problems, such as the way in which a 
jj^ buyer of fashion items can optimally allocate their spending budget among a set of 
yj 20 candidate fashion items. The sample data and calculations that appear in Figures 7-13 
r? illustrate the use of this embodiment in its economically constrained mode only. For this 
embodiment, values for the profit data are read from the tables shown in Figures 7-10, the 
profit functions and cost functions are shown in Figure 1 1, the cost constraint is shown in 
Figure 12, and solution values for the allocations are shown in Figure 13. In step 1400, 
25 data is input, including from an item attribute table, an item demand table, a system 
attributes table, and a season attributes table, such as the exemplary tables shown in 
Figures 7-10. In step 1402, the optimizer selects the first item from the input data and 
determines the maximum order quantity in step 1404. 

In step 1406, the optimizer determines whether the item is mandatory. If it is 
30 mandatory, step 1412 sets the first quantity choice equal to the minimum order quantity. 
If the item is not mandatory, at step 1408, the first quantity choice is set to 0. The profit 
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model in this exemplary embodiment uses a simulation to qalculate the profitability for 
any given amount of an item that the buyer purchases from the manufacturer. The profit, 
in this example, is the gross profit, which is defined as the difference between the sales 
income and the inventory cost (cf. Eq. (2-1)). 

In step 1414, the profit model is used to calculate the profitability of current 
quantity choices. At step 1416, if the current quantity is greater than or equal to the 
maximum order quantity, step 1418 checks whether this is the final item from the input 
data. If it is not the final item, the next item is selected in step 1410, and the maximum 
order quantity is determined in step 1404. At step 1416, if the current quantity is not 
greater than or equal to the maximum order quantity, then at step 1420, the optimizer 
checks to determine if the current quantity for the item is 0. If the current quantity is 0, 
then at step 1422, the quantity is incremented by the minimum order quantity. 

At step 1420, if the current quantity is not 0, then the optimizer increments the 
quantity by the incremental order quantity at step 1426. After the incrementation in step 
1422 or step 1426, the data is passed back to the profit model at step 1414 to determine 
profitability of the current quantity choice. 

At step 1418, if the final item profitability has been calculated, then at step 1424, 
the intermediate data, such as the data in the exemplary table shown in Figure 1 1 , is 
generated. At step 1430, the optimizer takes the constraint data from step 1428, such as 
the exemplary constraint shown in the table in Figure 12, and finds the maximum 
constrained profit. 

At step 1432, the output data showing the optimal solution generated by the 
optimizer is communicated to, for example, a display. An exemplary output table is 
shown in Figure 13. 

As discussed above, values for the profit data are read from the tables shown in 
Figures 7-10. Figure 7 shows an exemplary item attribute table 1500 associated with this 
embodiment of the invention. Exemplary include field 1502 value of "1" indicates that 
the item should be included in the optimization. Otherwise "0" is entered. Item ID 1504 
is an Item identifier. Item Description 1506 includes a text description of the item. Item 
Type 1508 "F" or "f ' indicates a fashion item, and "R" or "r" indicates a replenishment 
item. Total Lifecycle 1510 indicates the number of weeks the item is expected to be 
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selling in the stores, along with the weeks between the in-store date and the out-of-stock 
date. 

Cost 1512 indicates the unit cost of the associated item. Width 1514 indicates the 
item width for the item. In an embodiment, if the item is apparel, a numeric placeholder 
5 such as " 1 " can be entered. Height 1 5 1 6 indicates an item height, with a numeric 

placeholder entered for apparel. Capacity 1518 indicates facing capacity, or the number 
of units that fit on a shelf. A numeric placeholder may also be entered for capacity. 

Mandatory 1520 may contain True ("T" or "t") to indicate that the item must be 
part of the assortment, or False ("F" or "f ') to indicate that the item is optional. For 
10 Estimate Type 1522, "D" or "d" can indicate that the estimate entered in the Item 
Demand text file is a representation of demand rather than historical sales. 

Resupply factor 1 524 is a sample field for which a placeholder may be used. 
MOQ 1526 indicates minimum order quantity, such as the minimum number of units that 
must be bought if the item is included in the assortment. The MOQ 1526 may be 
1 5 determined by factors such as vendor requirements, fixture fill quantities, and other 
factors. The MOQ 1526 may be 0 if there is no minimum requirement. 

Incr Order Qty 1528 indicates that above the MOQ 1526, buy quantities must be 
in increments of the number of units entered. Case packs or number of stores may be 
factors in determining this number. If buys may be placed for any number of units, 1 can 
20 be entered. 

Min Shelf Presence 1530 indicates the number of units on the floor that will 
trigger the replenishment system to place an order. A numeric placeholder can be entered 
for fashion items. Holding Cost 1532 indicates, for example, the cost per unit per week 
of holding inventory in a warehouse or backroom. 

25 Vendor Lead Time 1534 indicates a number of weeks from the point the 

replenishment system orders an item until the item reaches the store floor. Start Sell Date 
1536 indicates the date the item will be on the floor in a significant number of stores. 
Nbr Of Stores 1538 indicates the number of stores that will carry the item. Pack Size 
1540 indicates, for example, the number units shipped in one factory pack. 

30 Figure 8 shows an exemplary item demand and price table 1 600 associated with 

this embodiment of the invention. Item ID 1504 indicates the item identifier. In an 
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exemplary embodiment, Entry -Type 1602 may include one f of three parameters for each 
item: 

DEMAND - Specifies that the "Week" column contains the regular (not 
promotional) demand. 

5 PROMO - Specifies that the "Week" column contains the demand that occurs 

during a promotional period. 

PRICE - Specifies that the "Week" column contains the unit price. 
Nbr. Of Entries 1604 specifies the number of "Week" columns that will be 
defined. This field allows the user to define profiles in which regular demand, 
1 0 promotional demand, and price fluctuate over time. In this example, a " 1 " indicates that 
average demand and price are constant over the life cycle of the item. 

Week 1 1606 indicates the value of Entry Type 1602 during week 1. Week 2 
1608 indicates the value of Entry Type 1602 during week 2. Week 3 1610 represents the 
g value of Entry Type 1602 during week 3. In this example a half year life cycle is used, 
»f 15 ending at Week 26 1612, which shows the value of Entry Type 1602 during week 26. 
€f Figure 9 shows an exemplary system attribute table 1 700 associated with this 

y embodiment of the invention. Periods of Supply 1702 indicates the target weeks of 
supply to maintain in the stores. Periods per Window 1704 indicates the number of 
f y weeks that comprise the sliding of the window used to determine average demand. 
jTj 20 Number of Simulation Trials 1 706 indicates the number of simulations the profit 

model will run. Higher numbers should lead to smaller margins of error, but longer run 
times. 

Interest Rate 1708 indicates the interest rate the company could attain if the 
money invested in the assortment is invested elsewhere. Date the Buys Will be Placed 
25 1710 indicates the date to which the present value of money will be calculated. 

Operating Hours per Period 1712 indicates the number of hours that the store is open for 
business in one time period. 

Stock Clerk Success Rate 1714 indicates the probability that a stock clerk will 
replenish an item when the on-shelf supply of the item is depleted to the trigger value. 
30 Stock Clerk Cycle Time 1716 indicates the elapsed time between the stock clerk's visits 
to the item location. Stock Clerk Trigger Value 1718 indicates that if the stock clerk 
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finds this number of units of an item on the shelf, the clerk should attempt to replenish 
the shelf. 

Figure 10 shows an exemplary season attributes table 1800 associated with this 
embodiment of the invention. Season Start Date 1802 represents the start of the time for 
5 which the buyer is selecting the assortment. For example, the buyer may be selecting a 
summer assortment that will be implemented on February 1 . Individual items have their 
own Start Sell Dates, for example field 1536 in Figure 7, that are independent of the 
Season Start Date 1802. Referring again to Figure 10, Season End Date 1804 indicates 
the end of the time for which the buyer is selecting the assortment. 

10 As discussed above, the profit functions and cost functions are shown in Figure 

11, which shows an exemplary item choice table 1900 associated with an embodiment of 
the invention. In an embodiment, the MCKP portion of the optimizer engine processes 
this table and chooses a maximum profit mixture of items and their quantities that does 
not exceed the cost constraint. Item ID 1504 indicates the item identifier. Item 

1 5 description 1 506 includes a text description of the item. Choice 1 902 indicates an index 
value corresponding to the quantity of an item. Index 1904 indicates an index value 
corresponding to an individual item. 

Profit 1906 indicates the profit earned if the number of units corresponding to 
choice 1902 is purchased. Cost 1908 indicates the amount that it costs to purchase the 

20 number of units corresponding to choice 1902. Units 1910 indicates the number of units 
of the item corresponding to choice 1902. 

As discussed above, the cost constraint is shown in Figure 12, which shows an 
exemplary constraint table 2000 associated with an embodiment of the invention. 
Constraint 2002 indicates the limit on the money that can be spent on the assortment, for 

25 example $150,000. In the final assortment selection, the sum of all of the "Cost" values, 
such as from field 1908 in Figure 19, must not exceed Constraint 2002. 

To illustrate this embodiment in greater detail, specifically the way in which the 
selected numbers in Figure 1 1 were generated, the following sample calculations are 
provided for Item 41 1, which corresponds to the first row in Figure 7. From the first row 

30 of Figure 8, the demand for Item 411 is 2390. From Figure 7 the total lifecycle (1510) of 
Item 41 1 is 26 weeks. Since from Figure 8 the number of entries (1604) is 1, the demand 
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* 

is understood to be uniformly spread across the 26 weeks of the item's lifecycle, making 
the weekly demand equal 2390/26 = 91 .92. Since from Figure 7 the minimum order 
quantity (MOQ 1526) for Item 41 1 is 1200, the first row of Figure 1 1 has 1200 in the 
column for units 1910. In subsequent rows of Figure 1 1, the units 1910 are incremented 
5 in steps of 48, the value given for the incremental order quantity 1528 in Figure 7. 

The highest number of units (1910) considered in Figure 1 1 for Item 41 1 is 3552. 
To realistically bound the number of units considered, a bound on the total demand can 
be used. For the example shown here, the largest number of units considered in the 
distribution of Item 41 1 was taken as 3552, a value that is approximately 150% of the 
10 total demand (i.e., 1.5 *2390 =3585 from the first row of Figure 8) and a multiple of the 
incremental order quantity (i.e., 3552/48=74 from the first row of Figure 7). For each of 
the number of units considered, the profit model calculates a cost (1908) and profit 
y= (1906). As stated above, this embodiment contains a profit model that is based on a 

g simulation. Demand is simulated throughout the item's lifecycle in weekly increments, 

jjj* 15 Since Item 41 1 has a 26-week lifecycle, the simulation generates 26 random numbers, 
yg each of which represent the demand during one week of this item's lifecycle and each of 
si which is shown under the weekly demand column of Table 5. As described above, a 

J ^ variety of distribution models (e.g., Poisson)may be used as the underlying distribution 

fy model for the demand. For This embodiment a normal distribution, which is 
yj 20 characterized by a mean and a standard deviation, was used. Here the mean was taken to 
be 91.92, estimated weekly demand as described above. The standard deviation was set 
as 28.84, based on an analysis of historical data. 

The cumulative demand in Table 5 is calculated by summing the weekly demand 
from all earlier weeks and is equal to the number of units of Item 41 1 that are sold when 
25 the supply is at least at that level. That is, the supply of units is the upper limit on the 
sales of the item. The cumulative sales column in Table 5 shows the cumulative number 
of units sold in all of the preceding weeks and is likewise limited by the supply. Only two 
examples of supply are shown - 1200 and 3552 units - but the simulation performs 
similar calculations for each number of units (1910) of Item 411 that are shown in Figure 
30 11. The cost, or inventory cost, is calculated by multiplying the total units sold by the 
cost of a single unit. The sales income is calculated by multiplying the total units sold by 
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the sales price of a single unit. The profit is then calculated by taking the difference 
between the sales income and the inventory cost. From Figure 7 the unit cost (1512) of 
Item 41 1 is $14.12, and from Figure 8 (row 3)the unit price is $24.00. 
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Total Units Sold 


1200 


2329 


Sales Income 


$28,800.00 


$52,704.00 


Inventory Cost 


$16,944.00 


$50,154.24 


Profit 


$11,856.00 


$ 2,549.76 



Table 5: Demand distribution values for Item 41 1. 

Both the consistency and the accuracy of the simulation increase when the user 
runs it repeatedly and then combines the results in some way such as averaging. Figure 9 
shows that the number of simulation trials (1706) for this calculation has been set to 10. 
The sample data in Table 5 illustrates the way in which the simulation works, but does so 
only for 1 of the 10 simulation trials. For this reason, the sample calculations are only 
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approximately equal to the cost and profit shown in Figure .1 1 . For example, the Table 5 
shows a profit of $2549.76 when the units equal 3552, whereas Figure 1 1 shows a profit 
of $6867.36 for the same number of units. In order to make all of the numbers match 
exactly, all of the numbers, all 10 simulations would need to be combined. 

As discussed above, solution values for the allocations are shown in Figure 13, 
which shows an exemplary optimal solution table 2100 associated with an embodiment of 
the invention. Item ID 1504 indicates the item identifier. Item description 1506 includes 
a text description of the item. Choice 2102 indicates an index value corresponding to the 
optimal quantity of an item. Index 2104 indicates an index value corresponding to an 
individual item. 

Units 2110 indicates the optimal number of units of the item. Profit 2106 
indicates the profit earned if the optimal number of units 21 10 is purchased. Cost 2108 
indicates the amount that it costs to purchase the optimal number of units 2110. 

As illustrated by the embodiment shown in Figures 7-13, the present invention 
enables stochastic optimization of allocations in a wide variety of operational settings 
including many practical business applications. 

Although the invention has been described in conjunction with particular 
embodiments, it will be appreciated that various modifications and alterations may be 
made by those skilled in the art without departing from the spirit and scope of the 
invention. The invention is not to be limited by the foregoing illustrative details, but 
rather is to be defined by the appended claims. 
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